Finding Novel Associations Across Domains Using Linked Data: a Case Study on Genetic Variants Disrupting Transcription Start Sites

نویسندگان

  • Eelke van der Horst
  • Rajaram Kaliyaperumal
  • Zuotian Tatum
  • Mark Thompson
  • Erik Schultes
  • Eleni Mina
  • Ivo Fokkema
  • Johan T. den Dunnen
  • Marco Roos
  • Jeroen F. J. Laros
  • Barend Mons
  • Kristina M. Hettne
  • Peter A. C. 't Hoen
چکیده

With the widespread use of Next Generation Sequencing technologies, the primary bottleneck of genetic research has shifted from data production to data analysis. However, heterogeneous data sets makes comparisons and integration challenging and time consuming. Here, we apply a data interoperability approach that provides unambiguous (machine readable) description of genomic annotations based on nanopublications. We show that novel associations can be discovered by performing queries that span genomic annotations from the FANTOM5 consortium (catalogue of transcription start site(s) (TSS)) and an instance of the Leiden Open Variation Database containing population-based variant frequency data. A correlation was established between the tissue specificity of a TSS and the frequency and size of genetic variants in the genomic region covering the TSS: TSS that are used in many tissues are less tolerant to disruption by insertions and deletions than those specific to just a few tissues.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evolutionary Constraint and Disease Associations of Post-Translational Modification Sites in Human Genomes

Interpreting the impact of human genome variation on phenotype is challenging. The functional effect of protein-coding variants is often predicted using sequence conservation and population frequency data, however other factors are likely relevant. We hypothesized that variants in protein post-translational modification (PTM) sites contribute to phenotype variation and disease. We analyzed frac...

متن کامل

Systematical analyses of variants in CTCF-binding sites identified a novel lung cancer susceptibility locus among Chinese population

Genome-wide association studies identified genetic susceptibility variants mostly lie outside of protein-coding regions. It suggested variants located at transcriptional regulatory region should play an important role in cancer carcinogenesis including lung cancer. In the present study, we systematically investigated the associations between the variants in the binding sites of an extensive tra...

متن کامل

Association of Obesity Related Genetic Variants (FTO and MC4R) with Breast Cancer Risk:a population-based case-control study in Iran

Background: The heterogeneous breast cancer is the most common cause of cancer-related mortality. Obesity defined by BMI is known as a major risk factor for breast cancer. Objective: The purpose of this study was to explore the role of obesity related-polymorphisms rs9939609 FTO and rs17782313 MC4R in breast cancer development. Materials and Methods: We obtained matched peripheral blood, serum ...

متن کامل

An integrated map of genetic variation from 1,092 human genomes

By characterizing the geographic and functional spectrum of human genetic variation, the 1000 Genomes Project aims to build a resource to help to understand the genetic contribution to disease. Here we describe the genomes of 1,092 individuals from 14 populations, constructed using a combination of low-coverage whole-genome and exome sequencing. By developing methods to integrate information ac...

متن کامل

Mapping of six somatic linker histone H1 variants in human breast cancer cells uncovers specific features of H1.2

Seven linker histone H1 variants are present in human somatic cells with distinct prevalence across cell types. Despite being key structural components of chromatin, it is not known whether the different variants have specific roles in the regulation of nuclear processes or are differentially distributed throughout the genome. Using variant-specific antibodies to H1 and hemagglutinin (HA)-tagge...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015